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DETAILED ACTION 
Remarks 

1. In response to communications filed on 07-November-2006, claims 1,18, and 22 are 
amended. Claims 1-31 are presently pending in the application, of which, claims 1, 8, 18, 
and 22 are presented in independent form. 

2. For the purpose of continued examination of this application, the Examiner takes Official 
notice that it is known in the art to use methods "similar to methods for identifying frequent 
itemsets in data", in discrete clustering of data. The Examiner cites the following US Patent 
Applications in support of the above Official Notice: 



Patent/Pub. No. 


Issued to 


Cited for teaching itemset identification in discrete 
clustering of data. 


US 2002/0049740 Al 


Aming et al. 


Paragraphs 7, 8, 9, and 26. 


US 2006/0026152 Al 


Zeng et al. 


Paragraphs 8, 9, 10, 33, and claim 4. 


US 6,138,177 


Bayardo 


Figures 3 and 5; column 6, lines 10-29; column 9, lines 
23-39; column 12, lines 56-65; and column 14, lines 6-35. 



Claim Rejections - 55 USC § 103 
3. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all obviousness 
rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 
102 of this title, if the differences between the subject matter sought to be patented and the prior art are such that said 
subject matter as a whole would have been obvious at the time the invention was made to a person having ordinary skill 
in the art to which said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

4. Claims 1-6, 8-14, 16-26, and 28-31 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Favvad et al (PCT Pub No. WO 99/62007) in view of Kothurietal (U.S. Patent No. 
6,470,344 Bl), and further in view of Examiner's Official notice (see paragraph 4 of this 
Office Action for a Ust of cited references.) 

As to claim 1, Fawad et al. teaches a method for clustering data in a database 
comprising: 

a) providing a database having a number of data records having both discrete and 
continuous attributes (see page 7, lines 4-6); 

b) grouping together data records in a clustering model (see Abstract) from the database 
which have specified discrete attribute configurations (see page 8, lines 5 through page 9, 
lines 1-13; and see Table 1 and "Cluster Attribute A^alue Probability Tables"); 

c) clustering data records having the same or similar specified discrete attribute 
configuration based on the continuous attributes to produce an intermediate set of data 
clusters (see page 11, line 42 through page 12, line 32); and 

d) merging together clusters from the intermediate set of data clusters to produce a 
clustering model (see page 14, lines 26-28; and see figures 8A-8D). 

Fawad et al does not teach performing clustering in two phases, over a discrete attribute 
and using a method for clustering continuous attribute data, wherein the first phase precedes 
the second phase. 
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Kothuri et al teaches buffering hierarchical index of multi-dimensional data (see 
Abstract), in which he teaches clustering of data in two phases, over a discrete attribute and 
using a method for clustering continuous attribute data (see column 12, lines 40-54, and see 
column 14, lines 30-65), wherein the first phase precedes the second phase (it is inherent that 
in a two-phased clustering, what is known as the "first phase" occurs before the phase known 
as the "second phase". Therefore, it is inherent that the first phase precedes the second 
phase.) 

Therefore, it would have been obvious to a person having ordinary skill in the art at the 
time the invention was made to have modified Fawad et al by the teaching of Kothuri et aU 
because including clustering in two phases, over a discrete attribute and using a method for 
clustering continuous attribute data, wherein the first phase precedes the second phase, would 
enable the system to store different types of data, based on their attributes, into different 
clusters or groups (e.g. clustering data with attributes having discrete values, determining the 
number of positive values, and clustering data with attributes having continuous values 
(range of values), as taught by Kothuri et al (see column 14, lines 30-65.) 

Fawad et al. as modified still does not teach using an itemset identification. 

The Examiner is taking Official Notice that it is known in the art to use methods "similar 
to methods for identifying fi-equent itemsets in data", in discrete clustering of data. The 
Examiner directs the Applicant's attention to the references cited in paragraph 2 of this 
Office Action in view of the Examiner's Official notice. 
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As to claims 2, 9, and 23, Fa wad et al. as modified, teaches wherein the clustering model 
includes a table of probabilities for the discrete data attributes of the data records for a cluster 
and wherein the cluster model for continuous data attributes comprises a mean and a 
covariance for each cluster lines (see Fawad et al claim lb). 

As to claims 3, 14, and 24, Fawad et al. as modified, teaches wherein the process of 
merging of intermediate clusters is ended when a specified number of clusters has been 
formed (see Fawad et aK page 8, lines 12-14, where "specified number of clusters" is read 
on "initial cluster number K=3"; and see claim 14, where "specified number of clusters" is 
read on "K clusters"). 

As to claims 4 and 25, Fawad et al. as modified, teaches wherein the step of merging of 
intermediate clusters is ended when a distance between intermediate clusters is greater than a 
specified minimum distance (see Fawad et al page 27, line 12 through page 28, line 26, 
where "distance between intermediate clusters" is read on "stopping criteria" and "specified 
minimum distance" is read on "the sum of these two numbers" and "the sum of these 
numbers"). 

As to claims 5 and 26, Fawad et al. as modified, teaches wherein the discrete attributes 
are Boolean and similarity between configurations is based on a distance between bit patterns 
of the discrete attributes (see Fawad et ah page 33 where "Boolean" and "bit patterns" is 
read on "0/1 assignments"). 
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As to claims 6 and 20, Fawad et al. as modified, teaches wherein one or more of the 
discrete attributes have more than two possible values and comprising the step of subdividing 
a discrete attribute having more than two possible values into multiple Boolean value 
attributes (see Fawad et al page 33 where "Boolean" and "two possible values" is read on 
"0/1 assignments"). 

As to claim 8, Fawad et al. teaches a method for clustering data in a database 
comprising: 

a) providing a database having a number of data records having both discrete and 
continuous attributes (see page 14, line 32 through page 15, line 2); 

b) performing a first discrete cluster and identifying a first set of configurations wherein 
the number of data records of each configuration of the first set of configurations exceeds a 
threshold number of data records (see page 15, line 21 through page 16, line 15, where 
"counting data records" is read on "counting the number of data records" and "exceeds a 
threshold number of data records" is read on "stopping criteria"); 

c) adding data records fi-om the database not belonging to one of the first set of 
configurations with a configuration within the first set of configurations to produce a subset 
of records from the database belonging to configurations in the first set of configurations (see 
page 15, lines 12-18, where "subset of records" is read on "compressed data"); and 

d) clustering the subset of records contained within at least some of the first set of 
configurations based on the continuous data attributes of records contained within that first 
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set of configurations to produce a clustering model (see page 15, lines 19-27, where 
"continuous data attributes" is read on "ordered attributes"). 

For the teaching of "performing a first discrete clustering", and "performing a second 
continuous clustering", the applicant is directed to the remarks and discussions made in claim 
1 above in view of the teachings of Kothuri et al as well as the Examiner's Official notice. 

As to claim 10, Fa wad et al. as modified, teaches wherein an added record not contained 
within the first set of configurations is added to one of the first set of configurations based on 
a distance between a smaller configuration to which the added record belongs during 
counting of records in different configurations (see Fawad et al , page 15, line 24-25, where 
"counting" is read on "'M' counting"). 

As to claims 1 1 and 28, Fayyad et al. as modified, teaches wherein the clustering of 
records fi-om a configuration based on continuous data attributes results in a variable number 
of clusters for each configuration based on the number of records in the configuration (see 
Fayyad et al , page 15, fines 19-32, where "continuous data attributes" is read on "ordered 
attributes"; and where "variable number of clusters" is read on "scalable clustering process"). 

As to claim 12, Fayyad et al. as modified, teaches wherein the clustering of records fi'om 
records falling within a configuration of the first set results in a number of intermediate 
clusters which are merged together to form the cluster model (see Fayyad et al , page 18, lines 
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23-31, where "records falling with a configuration" is read on "data points falling within a 
given cluster"). 

As to claim 13, Fawad et aL as modified, teaches wherein intermediate clusters are 
merged together based on a distance between clusters that is determined based on both 
continuous and discrete attributes of the intermediate clusters (see Fawad et al page 4, line 
20 through page 5, line 4, where "clusters are merged" is read on "membership of a given 
record in a particular cluster"; and see page 19, lines 1-7, where "distance between clusters" 
is read on "sufficiently 'close' to an existing CS subcluster"). 

As to claims 16 and 29, Fawad et al. as modified, teaches wherein a list of records of 
each configuration in the first set of configurations is maintained as data records are accessed 
from the database (see Fawad et al page 8, lines 5 through page 9, lines 1-13; and see Table 
1 and "Cluster Attribute/Value Probabihty Tables"). 

As to claims 17 and 30, Fawad et al. as modified, teaches where the clustering based on 
the continuous attributes of records within a configuration is performed using expectation 
maximization clustering of the continuous attributes (see Fawad et al page 4, line 20 
through page 5, line 4). 
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As to claim 18, Fawad et al teaches a data processing system comprising: 

a) a storage medium for storing a database having a number of data records having both 
discrete and continuous attributes (see page 7, Unes 4-9); 

b) a computer for evaluating data records from the database and building a clustering 
model that describes data in the database (see page 7, lines 1-5); and 

c) a database management system including a component for selectively retrieving data 
records from the database for evaluation by the computer (see page 7, lines 9-11, where 
"retrieving data records" is read on "brings data from the database"); 

For the teaching of "performing a first discrete clustering", and "performing a second 
continuous clustering", the applicant is directed to the remarks and discussions made in claim 
1 above in view of the teachings of Kothuri et al as well as the Examiner's Official notice. 

As to claim 19, Fawad et al. as modified, teaches wherein the computer includes a rapid 
access storage for maintaining a list of data records from the database for data records having 
a specified discrete attribute configuration to facilitate clustering of the data records based on 
their continuous attributes (see Fawad et ah page 5, lines 5-8). 



As to claim 21, Fawad et al as modified, teaches wherein the rapid access storage of the 
computer includes a data structure for storing a clustering model (see Fawad et al figures 
8A-8D). 
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As to claim 22, Fawad et al. teaches a computer readable medium containing stored 
instructions for clustering data in a database comprising instructions for (see page 7, lines 1- 
11): 

a) reading records from a database having a number of data records having both discrete 
and continuous attributes (see page 7, Unes 4-11, where "reading records" is read on "brings 
data from the database"); 

For the teaching of "performing a first discrete clustering", and "performing a second 
continuous clustering", the applicant is directed to the remarks and discussions made in claim 
1 above in view of the teachings of Kothuri et al as well as the Examiner's Official notice. 

As to claim 31, Fawad et al. as modified, teaches where records are assigned to a single 
cluster during the expectation maximization clustering process (see Fawad et al page 4, 
lines 26-31; and see claim 24). 

Allowable Subject Matter 

5. Claim 7, 15, and 27 are objected to as being dependent upon a rejected fiase claim, but would 
be allowable if rewritten in independent form including all of the limitations of the base 
claim and any intervening claims. 
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Response to Arguments 

6. The Applicant's arguments regarding the previously made rejection under the first paragraph 
of 35 U.S.C. 1 12, the arguments are fully considered and are deemed persuasive. The 
rejection is therefore, withdrawn. 

7. AppUcant's arguments filed on 07-November-2006 with respect to the rejected claims in 
view of the cited references have been fully considered but they are not deemed persuasive: 

Regarding the Applicant's arguments that, "nowhere does Kothuri teach the claimed two 
phases including a first phase which clusters the data record over a discrete attribute space, 
and a second phase that then uses a method of clustering continuous attributes using a 
method for clustering continuous attribute data to produce and intermediate set of data 
clusters, where the first phase precedes the second phase since it provides an input that is 
used in the second phase", the arguments have been fully considered but they are not deemed 
persuasive. As detailed in the rejection of claim 1, Kothuri et al teaches buffering 
hierarchical index of multi-dimensional data, in which he teaches clustering of data in two 
phases, over a discrete attribute and using a method for clustering continuous attribute data 
(see column 12, lines 40-54, and see column 14, lines 30-65.) As to "wherein the first phase 
precedes the second phase", it is inherent that in a two-phased clustering, what is knovra as 
the "first phase" occurs before the phase known as the "second phase". Therefore, it is 
inherent that the first phase precedes the second phase. 
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Regarding the Applicant's comments on the "Finality" of the previous Office Action, the 
Examiner's intention was to make the previous Office Action "Final", as stated in the 
conclusion section of the previous Office Action. However, the Examiner accidentally 
marked the wrong check-box in the Office Action Summary sheet as "Non-Final". In view 
of the Examiner's error and to clear the confiision, the Examiner makes the assumption that 
the previous Office Action was "Non-Final". The Examiner marks this Office Action as 
"Final". 

Conclusion 

8. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time policy 
as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS fi'om the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until 
after the end of the THREE-MONTH shortened statutory period, then the shortened statutory 
period will expire on the date the advisory action is mailed, and any extension fee pursuant to 
37 CFR 1.136(a) will be calculated fi-om the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS firom the date of 
this final action. 
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9. Any inquiries concerning this communication or earlier communications from the examiner 
should be directed to Tony Mahmoudi whose telephone number is (571) 272-4078. The 
examiner can normally be reached on Mondays-Fridays from 08:00 am to 04:30 pm. 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, Jeffrey Gaffin, can be reached at (571) 272-4146. ^ a /I /I ^/ / 
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tm 




